Farsi Characterrecognitionusingnew Hybrid Featureextractionmethods

نویسندگان

  • Fataneh Alavipour
  • Ali Broumandnia
چکیده

Identification of visual words and writings has long been one of the most essential and the most attractive operations in the field of image processing which has been studied since the last few decades and includes security, traffic control, fields of psychology, medicine, and engineering, etc. Previous techniques in the field of identification of visual writings are very similar to each other for the most parts of their analysis, and depending on the needs of the operational field have presented different feature extraction. Changes in style of writing and font and turns of words and other issues are challenges of characters identifying activity. In this study, a system of Persian character identification using independent orthogonal moment that is Zernike Moment and Fourier-Mellin Moment has been used as feature extraction technique. The values of Zernike Moments as characteristics independent of rotation have been used for classification issues in the past and each of their real and imaginary components have been neglected individually and with the phase coefficients, each of them will be changed by rotation. In this study, Zernike and FourierMellin Moments have been investigated to detect Persian characters in noisy and noise-free images. Also, an improvement on the k-Nearest Neighbor (k-NN) classifier is proposed for character recognition. Using the results comparison of the proposed method with current salient methods such as Back Propagation (BP) and Radial Basis Function (RBF) neural networks in terms of feature extraction in words, it has been shown that on the Hoda database, the proposed method reaches an acceptable detection rate (96/5%).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hybrid of Rough Neural Networks for Arabic/Farsi Handwriting Recognition

Handwritten character recognition is one of the focused areas of research in the field of Pattern Recognition. In this paper, a hybrid model of rough neural network has been developed for recognizing isolated Arabic/Farsi digital characters. It solves the neural network problems; proneness to overfitting, and the empirical nature of model development using rough sets and the dissimilarity analy...

متن کامل

Multilingual Hybrid Text Processing in Ancient Uighur (Chaghatai) Digitalized System

This research mainly considers and discusses system codepage in special techniques to multilingual processing of ancient Uighur literatures (Chagatai for abbreviation in the following text). Based on detailed analysis to Arabic code page, Farsi codepage and Uighur codepage in Unicode standard, we presented a codepage and keyboard layout, which is compatible with Chaghatai, Arabic, Farsi, Uighur...

متن کامل

A Hybrid Structural/Statistical Classifier for Handwritten Farsi/Arabic Numeral Recognition

In this paper a new Farsi/Arabic numeral recognition system, based on the combination of structural and statistical classifiers, is presented. The structural method cannot deal with broken characters well. A statistical classifier would be more suitable for these unconnected samples. Thanks to the combination of structural and statistical approaches, a complete description of the characters can...

متن کامل

Challenges in Persian Electronic Text Analysis

Farsi, also known as Persian, is the official language of Iran and Tajikistan and one of the two main languages spoken in Afghanistan. Farsi enjoys a unified Arabic script as its writing system. In this paper we briefly introduce the writing standards of Farsi and highlight problems one would face when analyzing Farsi electronic texts, especially during development of Farsi corpora regarding to...

متن کامل

Benchmarking SMT Performance for Farsi Using the TEP++ Corpus

Statistical machine translation (SMT) suffers from various problems which are exacerbated where training data is in short supply. In this paper we address the data sparsity problem in the Farsi (Persian) language and introduce a new parallel corpus, TEP++. Compared to previous results the new dataset is more efficient for Farsi SMT engines and yields better output. In our experiments using TEP+...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014